# Common Voice dataset

Whisper Small Ta
Apache-2.0
This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.
Speech Recognition Transformers Other
W
navin-kumar-j
38
1
Whisper Small Fr
Apache-2.0
This is a Whisper-small speech recognition model fine-tuned on French datasets, reducing the word error rate by 6.793 percentage points compared to the baseline model.
Speech Recognition Transformers French
W
mozilla-ai
30
1
Whisper Base Pl
Apache-2.0
A speech recognition model fine-tuned on the Polish Common Voice 17.0 dataset based on OpenAI Whisper-base
Speech Recognition Transformers Other
W
marcsixtysix
27
1
Whisper Large V3 Cantonese
Apache-2.0
A Cantonese automatic speech recognition model fine-tuned on Whisper v3, trained on the Common Voice 17 dataset
Speech Recognition Transformers Other
W
khleeloo
25
4
Finetuned Whisper Mr
Apache-2.0
A Whisper small speech recognition model fine-tuned on the Common Voice 17.0 Marathi dataset, based on simran14/mr-model-h
Speech Recognition Transformers Other
F
simran14
38
1
Wav2vec2 Large Xls R 300m Amharic Demo Colab
Apache-2.0
Amharic speech recognition model fine-tuned on the common_voice_16_1 dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
DipsankarSinha
18
2
Wav2vec2 Large Xls R 300m Albanian Colab
Apache-2.0
This model is a speech processing model fine - tuned on the common_voice_albanian dataset based on facebook/wav2vec2-xls-r-300m, suitable for Albanian - related tasks.
Speech Recognition Transformers
W
Alimzhan
8,810
1
Wav2vec2 Large Xlsr Mvc Swahili
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53, specifically designed for automatic speech recognition tasks in Swahili.
Speech Recognition Transformers Other
W
eddiegulay
9,413
2
Whisper Small Dv
Apache-2.0
A Dhivehi (official language of Maldives) automatic speech recognition model fine-tuned based on OpenAI Whisper-small, trained on Common Voice 13 dataset
Speech Recognition Transformers Other
W
voxxer
21
1
Whisper Small Fa
The Whisper (small) model fine-tuned by the Hezar team based on the Persian part of the Common Voice dataset, which can be used for automatic speech recognition tasks.
Speech Recognition Other
W
hezarai
363
11
Banglaasr
MIT
This is a Bengali automatic speech recognition model based on the Whisper small architecture, fine-tuned on approximately 400 hours of Mozilla Common Voice dataset with a word error rate of 4.58%
Speech Recognition Transformers
B
bangla-speech-processing
782
15
Whisper Large Persian
Apache-2.0
Persian automatic speech recognition model based on Whisper architecture, fine-tuned on Common Voice 11.0 Persian dataset
Speech Recognition Transformers Other
W
steja
800
12
Whisper Large V2 Kazakh
Apache-2.0
This model is a fine-tuned speech recognition model based on OpenAI's Whisper Large V2 on the Kazakh Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
DrishtiSharma
40
3
Whisper Tiny Es
Apache-2.0
A speech recognition model fine-tuned on Spanish dataset based on OpenAI Whisper-tiny
Speech Recognition Transformers Spanish
W
arpagon
26
3
Exp W2v2t Fa Hubert S801
Apache-2.0
A Persian automatic speech recognition model fine-tuned from facebook/hubert-large-ll60k, trained using the Common Voice 7.0 Persian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
16
0
Exp W2v2t Sv Se Wavlm S42
Apache-2.0
A Swedish automatic speech recognition model fine-tuned from microsoft/wavlm-large, suitable for 16kHz sampled audio input.
Speech Recognition Transformers
E
jonatasgrosman
20
0
Exp W2v2t It Wavlm S895
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on microsoft/wavlm-large, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
42
0
Exp W2v2t It No Pretraining S842
Apache-2.0
Fine-tuned from a randomly initialized wav2vec2 model for Italian speech recognition tasks, trained on the training split of Common Voice 7.0 (Italian).
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Exp W2v2t It Xlsr 53 S387
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Exp W2v2t It Wav2vec2 S609
Apache-2.0
An Italian automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-lv60, trained using the Common Voice 7.0 Italian dataset.
Speech Recognition Transformers Other
E
jonatasgrosman
18
0
Exp W2v2t Th Hubert S533
Apache-2.0
A Thai speech recognition model fine-tuned from facebook/hubert-large-ll60k, trained on data from Common Voice 7.0
Speech Recognition Transformers Other
E
jonatasgrosman
19
0
Exp W2v2t En Vp Nl S281
Apache-2.0
An English speech recognition model fine-tuned based on facebook/wav2vec2-large-nl-voxpopuli, trained using the Common Voice 7.0 training set.
Speech Recognition Transformers English
E
jonatasgrosman
18
0
Wav2vec2 Large Xls R 300m Tamil Colab
Apache-2.0
This model is a Tamil speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
Priya9
21
0
Model Facebookptbrlarge
Apache-2.0
A Brazilian Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53-portuguese model
Speech Recognition Transformers
M
Vkt
22
0
Wav2vec2 Base Common Voice 50p Persian Colab
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language, supporting Persian speech-to-text tasks.
Speech Recognition Transformers
W
zoha
21
0
Wav2vec2 Base Common Voice Persian Colab
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language datasets, primarily used for Persian speech-to-text tasks.
Speech Recognition Transformers
W
zoha
21
0
Wav2vec2 Large Xls R 300m Turkish Colab Common Voice 8 5
Apache-2.0
This is a Turkish speech recognition model based on the wav2vec2 architecture, fine-tuned on the Common Voice dataset with a word error rate (WER) of 0.3634.
Speech Recognition Transformers
W
husnu
22
0
Wav2vec2 Xls R 300m Mr Cv9 With Lm
Apache-2.0
An automatic speech recognition model fine-tuned on Marathi speech datasets based on Facebook's XLS-R-300M model
Speech Recognition Transformers Other
W
anuragshas
23
0
Wav2vec2 Xls R 300m Ur Cv9 With Lm
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on Urdu speech datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
anuragshas
18
1
Common Voice Lithuanian Fairseq
Apache-2.0
A Lithuanian automatic speech recognition model trained on the Common Voice dataset, implemented using the wav2vec2 architecture and fairseq framework.
Speech Recognition Transformers Other
C
birgermoell
30
0
Wav2vec2 Base Common Voice Fa Demo Colab
Apache-2.0
This model is a Persian speech recognition model fine-tuned based on facebook/wav2vec2-base, suitable for Persian speech-to-text tasks.
Speech Recognition Transformers
W
zoha
15
0
Wav2vec2 Cv Be
Gpl-3.0
An automatic speech recognition system fine-tuned on the Common Voice 8 Belarusian dataset based on facebook/wav2vec2-base model
Speech Recognition Transformers Other
W
ales
278
1
Wav2vec2 Common Voice Tr Demo Dist
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the Turkish COMMON_VOICE dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate (WER) of 33.05% on the evaluation set.
Speech Recognition Transformers Other
W
gary109
26
1
Output
Apache-2.0
Automatic speech recognition model fine-tuned on Mozilla Common Voice Portuguese dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
O
tonyalves
28
0
Sinai Voice Ar Stt
Apache-2.0
An Arabic speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m on the Common Voice Arabic dataset
Speech Recognition Transformers Arabic
S
bakrianoo
29
11
Wav2vec2 Large Xls R 300m El
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Greek Common Voice 8 dataset, based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition Transformers Other
W
ayameRushia
26
0
Wav2vec2 Common Voice Ab Demo
Apache-2.0
A speech recognition model fine-tuned on the COMMON_VOICE - AB dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Other
W
patrickvonplaten
18
0
Wav2vec2 Xlsr Lithuanian
Apache-2.0
This model is a fine-tuned automatic speech recognition model based on facebook/wav2vec2-xls-r-1b on Lithuanian dataset
Speech Recognition Transformers Other
W
sammy786
18
0
Wav2vec2 Common Voice Tr Demo
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE SV-SE dataset based on facebook/wav2vec2-large-xlsr-53, supporting Swedish speech recognition.
Speech Recognition Transformers
W
birgermoell
17
0
Wav2vec2 Large Xlsr Kinyarwanda Apostrophied
Apache-2.0
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Kinyarwanda, capable of predicting apostrophes in marked pronouns and vowel-initial word contractions
Speech Recognition Other
W
lucio
28
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase